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CLAIMS 

1. (Currently Amended) A packet voice conferencing method Comprising: 

receiving concurrently-captured first and second sound fieJd signals, the first and 
second sound field signals representing a single sound field captured at two spatially- 
separated points within a sound field; 

digitally encoding a signal block to represent the first and second sound field signals 
as captured during a first time period; 

estimating the relative temporal delay between the first and second sound field signals 
within the approximate timeframe of the first time period; 

transmitting to a remote conferencing point, in packet format, both the encoded signal 
block and a stereo decoding parameter based on the estimated relative temporal delays 

wherein estimarinp try relative temporal delay further m m r w fnr 
beginning and fading of a t^pnrt Rented i n the sound ffrid ri.n.1. ^ ^ 
va riation of the estimated relative temp ™i during * t a tVc r „ n 

2. (Original) The method of claim 1. wherein digitally encoding a signal block comprises 
combming the first and second sound field signals into a composite sound field signal by a 
method selected from the group of methods consisting of: 

selecting one sound field signal as the source of the composite sound field signal and 
discarding the other sound field signal; 

summing the first and second sound field signals; and 

averaging the first and second sound field signals. 

3. (Original) The method of claim 1, wherein estimating the relative temporal delay 

comprises; 

calculating, for each of a plurality of relative time shifts, a first-to-second sound field 
signal cross-correlation coefficient; and 

selecting the relative temporal delay to correspond to the relative time shift generating 
the largest cross-correlation coefficient. 

4. (Canceled) 



Docket No. 2705-103 p.™, nf , Q . .. . „ ^ 

rage 2 ot 1 9 Application No. 09/614,535 

PAGE4/21 * RCVD AT 5/912005 5:28:17 PM [Eastern Daylight Time] ' SVR:USPT0-EFXRF-1/9 ' DNIS:8729306 * CSID: ' DURATION (mm-ss):0M6 



MAY-09-2005 HON 01:32 PM HARGER JOHNSON 



FAX NO. 5032744622 



P. 



5. (Original) The method of claim 1 , wherein the Native temporal delay associated with 
the first time period is estimated using substantially only the sound field signals captured 
during the first time period. 

6- (Currently Amended) T h P method u f doim U A packet ^ , 

comprising 

receiving concurre ntly-^prwed fi m and Sf>Pond snimd fi „ ^ ^ ^ 

second sound fold , signals repentin g a single sound field o^r,A „ ^ 
separated points wjthin a sound field; 

digitally encp d jnp a block to reprint the first and * a S Qund fiffld g?f rnalo 

as captured, diirinp a fi rst p gripd; 

estimating the rel a tive temporal delay between the fi ~, sacond gnimH fl p ^ „ {f _,„ 
withm the approximate timeframe of % first tima p *rjpd- 

transmitting to a rempte conferencing point, in packet form at, both th. , : ^ } Hprlr 

apd a sterep deeding parameter hated on th* ~ti m ^ ^ ative famp rtral ^ lay , ^ 
wherein estimating the relative temporal delay further comprises tracking the 
beginning and ending of a talkspurt represented in the sound field signals, wherein relative 
temporal delay associated with the first time period is estimated using substantially all of the 
sound field signals corresponding to the current talkspurt, up to and including at least a first 
portion of the first time period. 

7. (Currently Amended) Th n met h od of claim 1, A packet vof Cft --^ 
comnrisinp: 

receiving concun-rntly-c , a ptured first and second sound fi.ld s ignals , th( , firgf „ nH 
jg cond sound field signals representin g a single s ound field Cflp t., re d at two T a H a ii y _ 
se para ted p oints within a sound field; 

digitally encodinp a s jpml Mock to represent the first and ^d , o und fieM gl>glo 
as captured during a fi rst time p eriod; 

estimating the , relative temporal delay between the first and sound fiftM gl> „ alc 

wjthm the approximate rin ^ frame of the first time p eriod; 

transmitting to a remote r-nnfrnrnf ri ng point in packet format both Th ff qncoded afwna? hM 
and a stereo decoding parameter based on the **H m ate d reWi™ t^ porft , delnv fln ^ 
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wherem estimating the relative temporal delay comprises detecting the beginning time 
of a talkspurt in each of the sound field signals, and selecting the relative temporal delay for a 
talkspurt to correspond to the difference in beginning times detected for that talkspurt. 

8. (Currently Amended) Tho method of claim 1, . A packet voW ^nf i nrn ^ rf! 

comprising; 

receiving concurrently-raptured first and second sound field sisals. t h« fW ^ 
second sound field signals representing a sinp l e s ound field r fl p n, > Ted at two ^ a Uy. 
separated p oints within a sound field; 

digitally encoding a sig n al hlock to represent the first and ,^h , o Qtl nd field « pn ^ 
as captured during a first time peri od; 

estimating the relative temporal delay between the first and c econd Sounrf fiftH gir , glc 
within the approximate timeframe of % fi r St time p eriod; 

transmit^ to fl remote conferencing point in n^t f T rmat , both 1ht . ^ roded sifmal 
and a sttreo decoding Parameter based on th* ^H ^ ated relative tr^rrt delay a „H 

wherein the stereo decoding parameter expresses the estimated relative temporal delay 
between the first and second sound field signals as an integer number of digital sampling 
intervals. 

9. (Original) The method of claim I, wherein the stereo decoding parameter expresses an 
estimated angle of arrival based on the estimated relative temporal delay and the relative 
positioning of the first and second spatially-separated points. 

10. (Currently Amended) - The method of claim 1, A packet voice ^winp „^ 
comprisin g: 

receiving concurrentl y-captured first and second sm,nd field sig nals, the first and 
second sound field signals representing a single * m m d fie , d cap pe d at two sp atially, 
separated points wit hin a sound , fipM; 

digitally encoding a signal block to rep r^ t the first and «e Aort d sound fi*ld cip ^ic 
as captured during a first time period; 

S ^mating the relative temporal delay hetw^ ft p first arid fiftftonH aQund fieM , ifinal(! 
within the approximate ti^ frame Q f th P first time p erind; 

transmitting to a remote conferencing point, in p ar.k.t fr ^at, both the ero ded signal hlnH, 
and a stereo decoding parameter based on the ^m ^ ed relative temporal delay an d 
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wherein the stereo decoding parameter corresponding to the digitally-encoded signal 
block representing the first time period is transmitted in the same packet as the digitally, 
encoded signal block. 

11. (Currently Amended) T ho me t hod of cla i m L A packet voir, conferencing mn^ 

comprisi ng: 

receiving concurrently-raptured first and second sound fi>ld siBnak thft an/l 
Second spupd field signals representing a «inp fr Sound field cap tl ]red at ^ s ^ Uy _ 
separated points within a sound fi ftld-, 

digitally encoding a ^Rpal block to represent the fW *n d seCQnd sound fi-M * ifT ^ 
as captured during a first time p eriod,- 

estimating fte relative temporal delay between the fW ,nH , e cond sound J| r U .i^i. 
Within the approximate tWframe of tn„ fi T St time p eriod, 

tra nsmitting to a remote conferencing point, in packe t format, hnth th* encoded signal 
and a stereo decoding parameter based on the estimated re l«n W t ftr QPOra i de>av! »nd 

wherein the stereo decoding parameter corresponding to the digitally-encoded signal 
block representing the first time period is transmitted in a later packet than the digitally- 
encoded signal block. 

12. (Currently Amended) Thn method of oluiiu 1, A packet voiee conferencing 
comprising' 

rec eiving concurrentlv-captured first and second sound fie ld signals, tn* fi~t m 
s econd sound field signals represen ting a single s ound field ™ r u ved at two sn atimiy. 
separate d points within a sound field; 

di gitally encoding a signal block to represent the first a „ d second sound ..-^.i. 
as captured during a, first time p ei-jod,; 

estimating the relative temporal delay between th» r r s t and second ^, md field sim^l. 
Within the approximate Timeframe of the first tiW r ^rind; 

transmitting to a remote conferencing point, in packet format hoth , th. encoded sio™! 
a nd a stereQ decoding paramet er based on t he e stimated na tive temporal del w ™a 

wherein the stereo decoding parameter corresponding to the digitally-encoded signal 
block representing the first time period is transmitted in a packet separate from any digitally- 
encoded signal block. 
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13. (Currently Amended) Th e method of cla i m 1, A packet vniee .™ fe rencW 
comprising,: 

receiving eQncunentlY-ft wpin ied first and s e con d sound fieiH , tha first and 

g ficond sound field gjfinals representing a single sm u id field . np t» ^ at two sp atial^ 
separated points within p sound s f fafld; 

diRitaHy encoding a sjpna) Hock, to represent the first and ,«,»» n d ^und field » r !)h 
as captured during a first Jj m e neriod- 

estimating the relative tempon.! delay between the first and «^ rf « sound fi<tM dp wal . 
within the approximate t i meframe of the first time p eri^ ; 

a apsmittinB tP a remote conferencing point, in narket f o rmat , both th» encoded signal hy., 
a nd a stereo decoding parameter based on the g r ated relative t^ DoraJ de i av . nnH 
wherein the stereo decoding parameter is transmitted once per talkspurt. 

14. (Currently Amended) The mothod of oln im 1 , f u rther c u i i i w hiiui A packet vniee 
Conferencing m ethod comprising: 

receives concurrently-captured first and se™ n d sound field signals, the fi^ t „nd 
second sound fjeld signals representing a single s 0 und field cap t,. re d at two sp atially- 
separated points wit hin a sound fie ^; 

digjtajly encoding a sippa! block to represent the first and second sound field si ^fr 
as captured during a first time p eriod- 

estimating the relative temporal delay hetween r h e fi^t and ^n d sound field sj pnaU 
within the approximate timeframe of the first time p^H; 

tr ansmitting to a remote conferenc ing point, in p a ck et f ormat, hoth the encoded signal hi^w 
a nd a stereo decoding parameter based on the es ti mated relative temp oral delay- »r,A 

estimating the signal energy present in each sound field signal during the approximate 
timeframe of the first time period, and transmitting to the remote conferencing endpoim, in 
packet format, an explicit stereo balance parameter related to the relative signal energy in 
each sound field signal. 

15. (Currently Amended) The method of oh i m 1, furthm u u mp i inm B A packet voice 
conferenci ng method comprising 

receiving concnrrently-rn ptured first and second so„nd fie ld signals, the fir, r ^d 
second spund field signals representing a sing le, *o,, nd field eap tn^ d at two sp atially, 
separated points within a sound field; 
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tiffltallY encoding a signal block to represent the fir^ t an H second SfM1 „ rf fiwl/< 
as captured during a fi rSt time p erjod; 

estimating the relative temporal delay between the firc t an d second field sip nalg 

within the approximate ti meframe of the first time p er^ ; 

transmitting to a remote conferencin g po i nt, in rack et format, hoth the encode si^ l Mni .y 
and a gfereo decoding parameter based on th r A nimated relative tem poral del^y; » n * 

estimating the signal energy present in a frequency subband of each sound field signal 
during the approximate timeframe of the first time period, and transmitting to the remote 
conferencing endpoint, in packet format, an explicit stereo balance parameter related to the 
relative signal energy in that subband for each sound field signal. 

16. (Currently Amended) Thr- method of olnim 1, further oom y um^ A packet voice 
conferencing method r.ornprj.cin r 

receive concurrently-captured first and second sound fip. n siena]s . the fil .^ ^ 
second sound field signals representing a sir,p ) e soun d field raptured at two «p Bti"»Y- 
senarated points within a sound field; 

digitally encoding a sipnal block to represent the first and second so„nd field signals 
as captured durin g a first time period; 

estimating the relative temporal delay between th e first and ser^d sound field c^ ic 
within the approximate ti meframe of the first tim . e p-nWi- 

tr ansmitting to a rernpte conferencing, point, in packet format, hnth fr e encoded si^ H bi^i, 
a nd a stereo decoding parameter based nn th e estimated relative temporal del„ Y ; »rrf 

establishing a packet-based control protocol with the remote conferencing point, and 
using the control protocol to inf orm the remote conferencing point that an encoder 
performing the method of claim I is available for stereo packet voice conferencing. 

1 7. (Currently Amended) An apparatus comprising a computer-readable medium 
containing computer instructions that, when executed, cause a processor or multiple 
communicating processors to perform a method for packet voice conferencing, the method 
comprising: 

receiving concurrently-captured first and second voice sample streams, the first 
stream representing a first sound field signal captured at a first spatial location within a sound 
field, the second stream representing a second sound field signal captured at a second spatial 
location within the sound field; 
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encoding a block of combined voice samples for the first and second voice sample 
streams, the block representing voice samples captured during a first time period; 

estimating, using voice samples captured in the approximate timeframe of the first 
time period, the an_ e X plic i t ^ive temporal delay between the first and second sound field 
signals; 

transmitting to a remote conferencing point, in packet format, both the encoded block 
of combined voice samples and a stereo decoding parameter including fease^ea the estimated 
relative temporal delay. 

1 8. (Original) The apparatus of claim 1 7, wherein encoding a block of combined voice 
samples comprises combining voice samples for the first and second voice sample streams by 
a method selected from the group of methods consisting of: 

selecting one sample stream as the source of combined voice samples and discarding 
the other; 

summing a sample from the first stream and a sample from the second sfream, the 
samples representing substantially the same sample period; and 

averaging a sample from the first stream and a sample from the second stream, the 
samples representing substantially the same sample period. 

19. (Original) The apparatus of claim 17, wherein estimating the relative temporal delay 
comprises: 

calculating, for each of a plurality of sample index shift distances, a cross-correlation 
coefficient for a group of samples from one sample stream and a corresponding group of 
index-shifted samples from the other sample stream; and 

selecting the relative temporal delay to correspond to the sample index shift distance 
generating the largest cross-correlation coefficient. 

20. (Original) The apparatus of claim 19, wherein estimating the relative temporal delay 
further comprises tracking the beginning and ending of a talkspurt on the voice sample 
streams, and limiting the variation of the estimated relative temporal delay during a talkspurt. 

21. (Previously presented) The apparatus of claim 19, wherein the group of samples from 
one sample stream comprise the samples captured during the first time period. 
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22. (Original) The apparatus of claim 17, wherein estimating the relative temporal delay 
further comprises tracking the beginning and ending of a talkspurt on the voice sample 
streams, wherein the group of samples from one sample stream comprise approximately all 
samples received within a current talkspurt, up to and including at least a first portion of the 
first time period, for that sample stream. 

23. (Original) The apparatus of claim 17, wherein estimating the relative temporal delay 
comprises detecting the beginning time of a talkspurt in each of the first and second sample 
streams, and selecting the relative temporal delay for a talkspurt to correspond to the 
difference in beginning times detected for that talkspurt. 

24. (Original) The apparatus of claim 17, wherein the stereo decoding parameter expresses 
the estimated relative temporal delay between the first and second sound field signals in 
samples. 

25. (Original) The apparatus of claim 17, wherein the stereo decoding parameter expresses 
an estimated angle of arrival based on the estimated relative temporal delay and the relative 
positioning of the first and second spatial locations. 

26. (Original) The apparatus of claim 17, wherein the stereo decoding parameter 
corresponding to the encoded block of voice samples captured during a first time period is 
transmitted in the same packet as those voice samples. 

27. (Original) The apparatus of claim 1 7, wherein the stereo decoding parameter 
corresponding to the encoded block of voice samples captured during a first time period is 
transmitted in a later packet than those voice samples. 

28. (Original) The apparatus of claim 17, wherein the stereo decoding parameter 
corresponding to' the encoded block of voice samples captured during a first time period is 
transmitted in a packet containing no encoded block of voice samples. 

29. (Original) The apparatus of claim 17. wherein the stereo decoding parameter is 
transmitted once per talkspurt. 
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30. (Previously presented) The apparatus of claim 1 7. wherein the method further 
comprises estimating, using voice samples captured in the approximate timeframe of the first 
time period, the signal energy in each sound field signal, and transmitting to the remote 
conferencing endpoint, in packet format, an explicit stereo balance figure related to the 
relative signal energy in each sound field signal. 

31. (Previously presented) The apparatus of claim 17, wherein the method further 
comprises estimating, using voice samples captured in the approximate timeframe of the first 
time period, the signal energy in a frequency subband of each sound field signal, and 
transmitting to the remote conferencing endpoint, in packet format, an explicit siereo balance 
figure related to the relative signal energy in that subband for each sound field signal. 

32. (Currently Amended) A packet voice conferencing system comprising: 

means for receiving concurrenUy-captured first and second sound field signals, the 
first and second sound field signals representing a single sound field captured at two ' 
spatially-separated points within a sound field; 

means for encoding a digital data block to represent the combined first and second 
sound field signals captured within a first time period; 

means for estimating, using the first and second sound field signals as captured in the 
approximate timeframe of the first time period, the anexpjicjt relative temporal delay 
between the first and second sound field signals; and 

means for encapsulating in a packet format the encoded digital data block and a stereo 
decoding parameter including hase^m the estimated relative temporal delay. 

33. (Original) The packet voice conferencing system of claim 32, wherein the means for 
receiving comprises a first sample buffer to receive digital voice samples representing the 
first sound field signal, and a second sample buffer to receive digital voice samples 
representing the second sound field signal. 

34. (Original) The packet voice conferencing system of claim 32, wherein the means for 
receiving comprises a data link interface to receive digital voice samples from a remote 
conferencing endpoint. 
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35. (Original) The packet voice conferencing system of claim 32, wherein the means for 
encoding comprises: 

an adder to create a combined sound field signal by summing the first and second 
sound field signals; and 

an encoder to encode the combined sound field signal as created over an interval 
corresponding to the first time period, thereby encoding the digital data block; 

36. (Original) The packet voice conferencing system of claim 32, wherein the means for 
estimating the relative temporal delay comprises a cross-correlator to correlate the first and 
second sound field signals for a plurality of relative time shifts. 

37. (Currently Amended) A packet voice conferencing system comprising: 

a sound field signal encoder to create a digitally-encoded signal block to represent 
both a first and a second sound field signal as captured within a first time period, the first and 
second sound field signals representing a single sound field captured at two spatially- 
separated points within a sound field; 

a stereo parameter estimator to estimate the an explicit relative temporal delay 
between the first sound field signal and the second sound field signal within the approximate 
timeframe of the first time period; and 

a packet formatter to encapsulate into at least one packet the digitally-encoded signal 
block and a stereo decoding parameter including based^a the estimated relative temporal 
delay, 

38. (Original) The system of claim 37, further comprising a voice activity detector to detect 
when voice energy is represented in the first and second sound field signals, the voice activity 
detector supplying a voice activity detection signal to the packet formatter when voice 
activity is present, the packet formatter using the voice activity detection signal to inhibit 
packet generation when voice activity is not present. 

39. (Original) The system of claim 38, the voice activity detector supplying the voice activity 
detection signal to the stereo parameter estimator, the stereo parameter estimator using the 
voice activity detection signal as an enabling signal. 
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40. (Original) The system of claim 38, the voice activity detector supplying the voice activity 
detection signal to the stereo parameter estimator as first and second signal components, the 
first component representing voice activity detection for the first sound field signal and the 
second component representing voice activity detection for the second sound field signal, the 
stereo parameter estimator estimating the relative temporal delay using the temporal delay 
between voice activity detection in the first and second components. 

41. (Original) The system of claim 37, wherein the first and second sound field signals are 
digitally sampled, the system further comprising first and second sample buffers to 
respectively buffer digital samples for the first and second sound field signals and supply 
buffered samples to the stereo parameter estimator and sound field signal encoder. 

42. (Original) The system of claim 37, wherein the sound field signal encoder comprises an 
adder to create a combined sound field signal by summing the first and second sound field 
signals; and 

an encoder to encode the combined sound field signal as created over an interval 
corresponding to the first time period, thereby created the digitally-encoded signal block. 

43. (Original) The system of claim 37, wherein the stereo parameter estimator comprises a 
cross-correlator to compute a first-to-second sound field signal cross-correlation coefficient 
for a plurality of relative time shifts, the estimated temporal delay based on the relative time 
shift having the largest cross-correlation coefficient. 

44. (Previously presented) The system of claim 37, wherein the stereo decoding parameter 
comprises an explicit arrival angle based on the estimated temporal delay and a known 
configuration of the two spatially-separated points. 

45. (Previously presented) The system of claim 37, wherein the stereo parameter estimator 
further comprises a signal energy estimator to estimate the signal energy present in each of 
the first and second sound field signals in the approximate timeframe of the first time period, 
the packet formatter encapsulating an explicit stereo balance parameter related to the signal 
energy estimates. 
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46. (Previously presented) The system of claim 37, wherein the stereo parameter estimator 
further comprises a signal energy estimator to estimate the signal energy present in a 
frequency subband of each of the first and second sound field signals in the approximate 
timeframe of the first time period, the packet formatter encapsulating an.explicit stereo 
balance parameter related to the signal energy estimates. 

47. (Previously presented) A packet voice conferencing system comprising: 

a packet parser to receive voice packets received from a remote conferencing point, 
each voice packet containing at least one of an encoded signal block and a stereo decoding 
parameter, the stereo decoding parameter comprising at least one of an explicit delay 
parameter, an explicit balance parameter, and an explicit arrival angle parameter; 

a decoder to receive encoded signal blocks from the packet parser and decode those 
signal blocks to produce a voice sample stream; and 

a playout splitter coupled to the voice sample stream, the splitter using the stereo 
decoding parameter to create multiple output signal channels based on the voice sample 
stream. 

48. (Original) The packet voice conferencing system of claim 47, further comprising a jitter 
buffer inserted in the voice sample stream between the decoder and the playout splitter. 

49. (Previously presented) The packet voice conferencing system of claim 47, wherein the 
stereo decoding parameter comprises an explicit delay parameter, the splitter delaying 
playout of the voice sample stream on at least one output signal channel, relative to playout 
of the voice sample stream on another output signal channel, based on the value of the 
explicit delay parameter. 

50. (Previously presented) The packet voice conferencing system of claim 47, wherein the 
stereo decoding parameter comprises an explicit balance parameter, the splitter modifying the 
playout amplitude of the voice sample stream on at least one output signal channel, relative to 
the playout amplitude of the voice sample stream on another output signal channel, based on 
the value of the explicit balance parameter. 

51 . (Original) The packet voice conferencing system of claim 50, wherein the playout 
amplitude modification is audio-frequency dependent. 
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52. (Original) The packet voice conferencing system of claim 47, further comprising a mixer 
to mix the output signal channels with other signal channels derived from voice packets 
received from another remote conferencing point. 

53. (Original) The packet voice conferencing system of claim 52, further comprising a 
packet formatter to place the mixer output in packet format for transmission to a remote 
conferencing endpoint. 

54. (Previously presented) A packet voice conferencing system comprising: 

means for decoding encoded signal blocks to produce a voice sample stream, each 
encoded signal block received in packet format from a remote conferencing point; and 

means tor splitting, based on the value of a stereo decoding parameter received in 
packet format from a remote conferencing point, the voice sample stream into multiple output 
signal channels to produce a stereophonic effect, the stereo decoding parameter comprising at 
least one of an explicit delay parameter, an explicit balance parameter, and an explicit arrival 
angle parameter. 



55. (Previously presented) The packet voice conferencing system of claim 54, wherein the 
stereo decoding parameter comprises an explicit delay parameter, the means for splitting the 
voice sample stream comprising means for delaying playout of the voice sample stream on at 
least one output signal channel, relative to playout of the voice sample stream on another 
output signal channel, based on the value of the explicit delay parameter. 

56. (Previously presented) The packet voice conferencing system of claim 54, wherein the 
stereo decoding parameter comprises an explicit balance parameter, the means for splitting 
the voice sample stream comprising means for modifying the playout amplitude of the voice 
sample stream on at least one output signal channel, relative to the playout amplitude of the 
voice sample stream on another output signal channel, based on the value of the explicit 
balance parameter. 

57. (Previously presented) The packet voice conferencing system of claim 54, wherein the 
stereo decoding parameter comprises an explicit arrival angle parameter, the means for 
splitting the voice sample stream comprising means for calculating a delay parameter for at 
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least one output signal channel to create the perception that the audio signal represented 
the voice sample stream is arriving at an angle corresponding to the explicit arrival angl 
parameter. 



in 
e 



58. (Previously presented) A packet voice conferencing method comprising: 

receiving, from a remote conferencing point, a voice packet stream, at least some 
voice packets in the stream carrying a payload comprising an encoded signal block, at least 
some voice packets in the stream carrying a payload comprising a stereo decoding parameter, 
the stereo decoding parameter comprising at least one of an explicit delay parameter, an 
explicit balance parameter, and an explicit arrival angle parameter; 

decoding the encoded signal blocks to produce a voice sample stream; 
splitting the voice sample stream into multiple output signal channels; and 
manipulating the signal carried on at least one of the output signal channels based on 
the value of the stereo decoding parameter to create a stereophonic effect on the output signal 
channels, 

59. (Previously presented) The method of claim 58, wherein the stereo decoding parameter 
comprises an explicit delay parameter, and wherein manipulating the signal carried on at least 
one of the output signal channels comprises delaying playout of the voice sample stream on 
at least one output signal channel, relative to playout of the voice sample stream on another 
output signal channel, based on the value of the explicit delay parameter. 

60. (Previously presented) The method of claim 58. wherein the stereo decoding parameter 
comprises an explicit balance parameter, and wherein manipulating the signal carried on at 
least one of the output signal channels comprises modifying the playout amplitude of the 
voice sample stream on at least one output signal channel, relative to the playout amplitude of 
the voice sample stream on another output signal channel, based on the value of the explicit 
balance parameter. 

61. (Previously presented) The method of claim 58, wherein the stereo decoding parameter 
comprises an explicit arrival angle parameter, and wherein manipulating the signal carried on 
at least one of the output signal channels comprises calculating a delay parameter for at least 
one output signal channel to create the perception that the audio signal represented in the 
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voice sample stream is arriving at an angle corresponding to the explicit arrival angle 
parameter. 

62. (Previously presented) An apparatus comprising a computer-readable medium 
containing computer instructions that, when executed, cause a processor or multiple 
communicating processors to perform a method for packet voice conferencing, the method 
comprising: 

receiving, from a remote conferencing point, a voice packet stream, at least some 
voice packets in the stream carrying a payload comprising an encoded signal block, at least 
some voice packets in the stream carrying a payload comprising a stereo decoding parameter, 
the stereo decoding parameter comprising at least one of an explicit delay parameter, an 
explicit balance parameter, and an explicit arrival angle parameter, 

decoding the encoded signal blocks to produce a voice sample stream; 

splitting the voice sample stream into multiple output signal channels; and 

manipulating the signal carried on at least one of the output signal channels based on 
the value of the stereo decoding parameter to create a stereophonic effect on the output signal 
channels. 

63. (Previously presented) The apparatus of claim 62, wherein the stereo decoding 
parameter comprises an explicit delay parameter, and wherein manipulating the signal carried 
on at least one of the output signal channels comprises delaying playout of the voice sample 
stream on at least one output signal channel, relative to playout of the voice sample stream on 
auother output signal channel, based on the value of the explicit delay parameter. 

64. (Previously presented) The apparatus of claim 62, wherein the stereo decoding 
parameter comprises an explicit balance parameter, and wherein manipulating the signal 
carried on at least one of the output signal channels comprises modifying the playout 
amplitude of the voice sample stream on at least one output signal channel, relative to the 
playout amplitude of the voice sample stream on another output signal channel, based on the 
value of the explicit balance parameter. 

65. (Previously presented) The apparatus of claim 62, wherein the stereo decoding 
parameter comprises an explicit arrival angle parameter, and wherein manipulating the signal 
carried on at least one of the output signal channels comprises calculating a delay parameter 
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for at least one output signal channel to create the perception that the audio signal represented 
in the voice sample stream j s arriving at an angle corresponding to the explicit arrival angle 



parameter. 
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